Decision tree based tone modeling with corrective feedbacks for automatic Mandarin tone assessment
نویسندگان
چکیده
We propose a novel decision tree based approach to Mandarin tone assessment. In most conventional computer assisted pronunciation training (CAPT) scenarios a tone production template is prepared as a reference with only numeric scores as feedbacks for tone learning. In contrast decision trees trained with an annotated tone-balanced corpus make use of a collection of questions related to important cues in categories of tone production. By traversing the corresponding paths and nodes associated with a test utterance a sequence of corrective comments can be generated to guide the learner for potential improvement. Therefore a detailed pronunciation indication or a comparison between two paths can be provided to learners which are usually unavailable in score-based CAPT systems.
منابع مشابه
Decision tree based Mandarin tone model and its application to speech recognition
Tone is an essential language phenomenon for Mandarin Chinese language. Until now, we still do not know exactly how context affects tone pattern variation in continuous Mandarin speech. In this paper, we proposed a decision tree based approach to obtain the quantitative result of tone pattern variation in continuous Mandarin speech. Many possible factors other than tone of neighboring syllables...
متن کاملUpdate progress of Sinohear: advanced Mandarin LVCSR system at NLPR
NLPR has been with long efforts on Mandarin speech recognition. This paper reports our recent process in this field with several significant novel characteristics: 1) Very large speech databases are used to learn more robust acoustic model; 2) Acoustic model has evolved from non-tonal class-triphone to tonal class-triphone based on tone-embedded decision tree, namely unified tone & triphone mod...
متن کاملIncorporating Pitch Features for Tone Modeling in Automatic Recognition of Mandarin Chinese
Tone plays a fundamental role in Mandarin Chinese, as it plays a lexical role in determining the meanings of words in spoken Mandarin. For example, these two sentences R R (I like horses) and R M (I like to scold) differ only in the tone carried by the last syllable. Thus, the inclusion of tone-related information through analysis of pitch data should improve the performance of automatic speech...
متن کاملA stochastic polynomial tone model for continuous Mandarin speech
In this paper, a stochastic polynomial tone model is presented for tone modeling in continuous mandarin speech. In this model, the pitch contour is described by a stochastic trajectory. The mean trajectory is represented by a polynomial function of normalized time while the variance is time varying. After that, an effective training and recognition algorithm is developed respectively. Also the ...
متن کاملCombined Use of Speaker- and Tone-Normalized Pitch Reset with Pause Duration for Automatic Story Segmentation in Mandarin Broadcast News
This paper investigates the combined use of pause duration and pitch reset for automatic story segmentation in Mandarin broadcast news. Analysis shows that story boundaries cannot be clearly discriminated from utterance boundaries by speaker-normalized pitch reset due to its large variations across different syllable tone pairs. Instead, speakerand tonenormalized pitch reset can provide a clear...
متن کامل